Screening method for gene variation

ABSTRACT

A method for screening presence or absence of a variation in a region of a nucleic acid which comprises the steps of (a) preparing a sample containing a test nucleic acid corresponding to the region, (b) preparing a probe having a base sequence fully complementary to a normal sequence of the region, and a plurality of probes each having at least one base not complementary to the normal sequence, (c) fixing the probes in separate regions on a surface of a substrate to prepare a DNA array substrate, (d) reacting the test nucleic acid with the probes on the DNA array substrate, (e) measuring signals in each region where the signals are originated from respective hybrids formed between the test nucleic acid and one of the probes, and (f) calling variation in the test nucleic acid using a pattern of total signals of all regions.

BACKGROUND OF INVENTION

[0001] 1. Field of the Invention

[0002] The present invention relates to a variation detection method and a system for it useful for screening of a gene variation etc.

[0003] 2. Related Background Art

[0004] One of the techniques for sequencing nucleic acid etc. or for detecting the sequence is to utilize a DNA array. U.S. Pat. No. 5,445,934 discloses a DNA array where 100,000 or more oligonucleotide probes are bonded in 1 inch square. Such a DNA array has an advantage that many characteristics can be examined at the same time with a very small sample amount. When a fluorescence-labeled sample is poured onto such a DNA chip, DNA fragments in the sample bind to probes having a complementary sequence fixed on the DNA chip, and only that part can be discriminated by fluorescence to elucidate the sequence of the DNA fragment in the DNA sample.

[0005] Sequencing By Hybridization (SBH) is a method for examining the base sequence utilizing such a DNA array and the details are described in U.S. Pat. No. 5,202,231. In the SBH method, all possible sequences of an oligonucleotide of a certain length are arranged on the substrate, then fully matched hybrids formed by hybridization reaction between probes and the sample DNA are detected. If a set of fully matched hybrids is obtained, the set will give an assembly of overlapping sequences with one base shift being a part of one certain sequence, which sequence is extracted for calling.

[0006] Not only the SBH method, when complementariness between an oligonucleotide and a sample DNA is examined, it is very difficult to call whether a hybrid was formed or not using one probe for one test item, since the stability of a hybrid differs sequence to sequence, and there is no perfect signal for calling the full complementariness. Science vol. 274 p.610-614, 1996 discloses a method for calling comparing the signal intensity of a perfect match hybrid and the weaker intensities of one-base mismatch hybrids. In this method, 15-mer oligonucleotide probes differing from each other only by one mismatching base at the center of the sequence are prepared, and the fluorescence intensities of the hybrids of the probes are compared, and when the intensity of the full matched hybrid is stronger than that of other hybrids by a predetermined rate, it is called positive.

[0007] Further, U.S. Pat. No. 5,733,729 discloses a method using a computer to know a base sequence of a sample from a comparison of fluorescence intensities of obtained hybrids for more accurate calling.

[0008] However, actual binding strength of a hybrid depends on the GC content etc., and difference of the fluorescence intensity between a full match hybrid and a one-base mismatch hybrid also varies in a considerable range depending on the sequence. Thus, a method for calling whether a sequence is fully complementary to a probe or not, using a 15 mer oligonucleotide probe to compare it with other three probes having one mismatched base at the center thereof, can provide more accuracy if each stability is evaluated theoretically or empirically before comparison.

[0009] In addition, accurate calling requires precise quantification of signals, and therefore, precision apparatuses such as a confocal laser microscope. Furthermore, in order to measure the fluorescence intensity of a hybrid of every probe and to determine the gene sequence by analyzing the data, a large-scale computer apparatus as well as a detection apparatus for reading the arrays are further required. Therefore this is a big obstacle for ready use of the DNA array.

[0010] On the other hand, gene diagnosis using such a DNA array may be used in group medical examination, individual gene examination or gene-polymorphism study. In such a case, however, the above described precise measurement and analysis are not always required, where a large amount of samples are rapidly treated at a low cost in order to find out variated samples concerning a specific item from a large number of normal samples. Further, the precision apparatus and analysis as described above will cost much. Accordingly, a concept that screening of the presence or absence of a variation is first performed, and then, detailed examinations are carried out about the samples suspected of variation by the screening will save time and cost.

SUMMARY OF THE INVENTION

[0011] One object of the present invention is to provide a method suitable for mass screening so as to determine rapidly the presence or absence of a gene variation without need of an expensive apparatus and a complex analysis.

[0012] The present invention provides a DNA array in which a group of probes which will give strong signals forming hybrids with a normal gene sequence, and a group of probes having sequences expected to form hybrids with gene variants are separately arranged, on the premise that the base sequence of a normal gene and those of variants have already been established. Furthermore, the above described object is achieved by providing a detection method using such an array.

[0013] According to one aspect of the present invention, there is provided a method for screening of the presence or absence of variation in a region of a nucleic acid comprising the steps of:

[0014] (a) preparing a test nucleic acid corresponding to the region;

[0015] (b) preparing a probe having a base sequence fully complementary to a normal sequence of the region, and a plurality of probes each having at least one base not complementary to the normal sequence;

[0016] (c) fixing the probes in separate regions on a surface of a substrate to prepare a DNA array substrate;

[0017] (d) reacting the test nucleic acid with the probes on the DNA array substrate;

[0018] (e) measuring signals in each region totaly where the signals are originated from respective hybrids formed between the test nucleic acid and one of the probes; and

[0019] (f) determining the presence or absence of mutation in the test nucleic acid comparing with a histogram pattern of signals of all regions obtained using a normal sample without variation.

[0020] According to another aspect of the invention, there is provided a DNA array substrate for screening a variation in a region of a nucleic acid, wherein

[0021] a full match probe fully complementary to a normal sequence of the region, and a plurality of mismatch probes having at least one base mismatch to the sequence are arranged on the substrate; and

[0022] the probes are arranged to form at least two separate regions selected from:

[0023] a first region containing at least one probe which provides a signal of a certain intensity on reaction with a nucleic acid having the normal sequence,

[0024] a second region containing at least one probe which provides a weaker signal than the probe of the first region on reaction with a nucleic acid having normal sequence, and

[0025] the third region containing at least one probe which provides no signal on reaction with a nucleic acid having normal sequence.

[0026] According to still another aspect of the present invention, there is provided a system for detecting variation comprising a DNA array substrate as described above and a signal measuring apparatus which measures signals from separate regions of the DNA array substrate.

[0027] The present invention can provide a method suitable for mass screening, so as to rapidly determine only the presence or absence of a gene variant, without need of an expensive apparatus and complex analysis.

BRIEF DESCRIPTION OF THE DRAWINGS

[0028]FIG. 1A is a schematic example of arrangement of separate regions on a DNA array substrate, and

[0029]FIG. 1B shows a histogram of fluorescence intensities (pattern) corresponding to separate regions of the above arrangement, measured using a line sensor:

[0030]FIG. 2 shows an arrangement of 64 types of DNA probes;

[0031]FIG. 3 shows a pattern of the fluorescence intensities obtained by hybridization;

[0032]FIG. 4 shows normal fluorescence intensities obtained by hybridization where Gr. NO. denotes group number;

[0033]FIG. 5 is a distribution pattern of fluorescence intensities corresponding to the separate regions of the arrangement, measured using a line sensor;

[0034]FIG. 6 is a distribution pattern of fluorescence intensities corresponding to the separate regions of the arrangement, measured using a line sensor;

[0035]FIG. 7 is a distribution pattern of fluorescence intensities corresponding to the separate regions of the arrangement, measured using a line sensor;

[0036]FIG. 8 is a distribution pattern of fluorescence intensities corresponding to the separate regions of the arrangement, measured using a line sensor; and

[0037]FIG. 9 shows normal fluorescence intensities obtained by hybridization where Gr. NO. denotes group number.

DETAILED DESCRIPTION OF THE INVENTION

[0038] The present invention provides a screening method for gene variants using a DNA array in which a group of probes which will give strong signals forming hybrids with a normal gene sequence, and a group of probes having sequences expected to form hybrids with gene variants are separately arranged, on the premise that the base sequence of a normal gene and those of variants have already been established. Furthermore, the above described object is achieved by providing a detection method using such an array.

[0039] Here, the present invention will be described in detail with examples where signals from the separate regions are fluorescence. However, signals in the present invention are not limited to fluorescence but may be other light signals or electric signals.

[0040] Binding strength between single-stranded nucleic acids to form a hybrid is controlled by various factors, and when a probe having a length of about 12 mer to 25 mer is used, it is practically difficult to perfectly exclude hybrids having one-base mismatches.

[0041] When the signal of the hybrid to be detected is a light such as fluorescence, the following phenomena are observed. Fluorescence stability of a hybrid having two mismatches (two-base mismatch hybrids) is much lower than that of the one-base mismatched hybrid, regardless of positions, continuity or discontinuity of the mismatched bases. On the other hand, signal from a three-base mismatch hybrid is hardly observed. However, one-base mismatch hybrids may have more than 50% signal intensity of a full match hybrid. Thus, when a sample is a nucleic acid of normal sequence, strong fluorescence is observed in a region where a full match probe and probes having one mismatching base have been arranged, while the fluorescence intensity in a region where hybrids of low stability are formed is almost zero. On the other hand, when a sample is a nucleic acid of a variant sequence, the probe fully complementary to the normal sample makes a mismatch hybrids with the sample. Thus the fluorescence is weaker at a region than that in case of the normal sample, at the same time, fluorescence from a full match hybrid and one-base mismatch hybrids with the sample appears in another region where low signals are expected in case of the normal nucleic acid. Accordingly, by comparing fluorescence histogram of the regions, it is possible to distinguish between the normal nucleic acid and the variant nucleic acid.

[0042] In the present invention, a DNA array substrate where probes are arranged in separate regions according to the fluorescence intensities of their hybrids with a normal nucleic acid is used for more accurate calling. First, hybridization reaction of the normal nucleic acid with each probe is performed, and based on the fluorescence intensities of the hybrids obtained, separate regions each containing probes corresponding to strong fluorescence, no fluorescence, and moderate (weak) fluorescence are located at predetermined positions on the substrate.

[0043] When performing a test on multiple items simultaneously, the substrate should be divided into areas for respective items. In each area, probes having full match and one-base mismatch sequences to the normal gene sequence are arranged in a region, which are expected to give high signals, and the other probes having more than two-base mismatch are arranged in a separate region depending on the items. Thus we can discriminate normal test samples from variated ones.

[0044] The arrangement can be determined according to the type of the sensor to be used. For example, when a line sensor is used, each region is arranged from the left to right in the substrate in order of the strength of fluorescence obtainable by hybridization with the normal nucleic acid. Thus, the fluorescence intensity will be maximum in the left, then gradually decrease and come to zero in several regions in the right of the substrate. When an area sensor is used, it is necessary to evaluate the fluorescence quantity of at least two separate regions containing a group of probes which will provide the maximum fluorescence and a group of probes which will not provide any fluorescence respectively.

[0045] This will be described more specifically.

[0046] In the present invention, for example, whether a gene is normal or not can be called by providing a region containing a probe that forms a hybrid with a normal nucleic acid and the other region containing probes that form hybrids with variant genes separately on a DNA array, and taking a ratio of the signal from a hybrid corresponding to the normal nucleic acid and to the signals from hybrids corresponding to variant genes.

[0047] However, because one-base mismatched hybrids sometimes have strong signals, it is difficult to detect variation in the test sample when a DNA array where only probes being full match or one-base mismatch to a normal sequence are arranged is used. It is important that probes having two-base mismatch to the normal nucleic acid is present in the DNA array, which might be one-base mismatch probes to the test sample to form hybrids having strong signals.

[0048] Therefore, in the present invention, in order to judge more accurately, a preferable method is as follows. Probes having full match and one-base mismatch sequences to the normal nucleic acid sequence, which most samples in mass screening have, are arranged in a specified region on a DNA array substrate. In addition, probes having two- or three-base mismatch are arranged in a region different from the above region for high stability hybrids. Then, a hybridization reaction is performed with the normal nucleic acid or a sample, using the DNA array substrate of such an arrangement. Then the total of signals for each separate region is measured, and a pattern obtained with a sample nucleic acid is compared that with the normal nucleic acid are compared to determine the presence or absence of variation.

[0049] For example, when 64 probes (4×4×4=64) where different bases are arranged at three positions in 18 base length (Table 1, 5′-terminus is on the left) are used supposing that variation occurs at these three points, for any sample there should be present a probe fully complementary to the sample, nine one-base mismatch probes, 27 two-base mismatch probes and 27 three-base mismatch probes. Furthermore, it is relatively easy to set conditions of the hybridization reaction such that fluorescence is observed with the full match and one-base mismatch hybrids but not with three-base mismatch hybrids. Some two-base mismatch probes form hybrids and others not, depending to their sequences. TABLE 1 SEQ ID NO: Sequence 1 GATGGGACTCAAGTTCAT 2 GATGGGACTCAGGTTCAT 3 GATGGGACTCACGTTCAT 4 GATGGGACTCATGTTCAT 5 GATGGGACTCGAGTTCAT 6 GATGGGACTCGGGTTCAT 7 GATGGGACTCGCGTTCAT 8 GATGGGACTCGTGTTCAT 9 GATGGGACTCCAGTTCAT 10 GATGGGACTCCGGTTCAT 11 GATGGGACTCCCGTTCAT 12 GATGGGACTCCTGTTCAT 13 GATGGGACTCTAGTTCAT 14 GATGGGACTCTGGTTCAT 15 GATGGGACTCTCGTTCAT 16 GATGGGACTCTTGTTCAT 17 GATGGGGCTCAAGTTCAT 18 GATGGGGCTCAGGTTCAT 19 GATGGGGCTCACGTTCAT 20 GATGGGGCTCATGTTCAT 21 GATGGGGCTCGAGTTCAT 22 GATGGGGCTCGGGTTCAT 23 GATGGGGCTCGCGTTCAT 24 GATGGGGCTCGTGTTCAT 25 GATGGGGCTCCAGTTCAT 26 GATGGGGCTCCGGTTCAT 27 GATGGGGCTCCCGTTCAT 28 GATGGGGCTCCTGTTCAT 29 GATGGGGCTCTAGTTCAT 30 GATGGGGCTCTGGTTCAT 31 GATGGGGCTCTCGTTCAT 32 GATGGGGCTCTTGTTCAT 33 GATGGGCCTCAAGTTCAT 34 GATGGGCCTCAGGTTCAT 35 GATGGGCCTCACGTTCAT 36 GATGGGCCTCATGTTCAT 37 GATGGGCCTCGAGTTCAT 38 GATGGGCCTCGGGTTCAT 39 GATGGGCCTCGCGTTCAT 40 GATGGGCCTCGTGTTCAT 41 GATGGGCCTCCAGTTCAT 42 GATGGGCCTCCGGTTCAT 43 GATGGGCCTCCCGTTCAT 44 GATGGGCCTCCTGTTCAT 45 GATGGGCCTCTAGTTCAT 46 GATGGGCCTCTGGTTCAT 47 GATGGGCCTCTCGTTCAT 48 GATGGGCCTCTTGTTCAT 49 GATGGGTCTCAAGTTCAT 50 GATGGGTTCTAGGTTCAT 51 GATGGGTCTCACGTTCAT 52 GATGGGTCTCATGTTCAT 53 GATGGGTCTCGAGTTCAT 54 GATGGGTCTCGGGTTCAT 55 GATGGGTCTCGCGTTCAT 56 GATGGGTCTCGTGTTCAT 57 GATGGGTCTCCAGTTCAT 58 GATGGGTCTCCGGTTCAT 59 GATGGGTCTCCCGTTCAT 60 GATGGGTCTCCTGTTCAT 61 GATGGGTCTCTAGTTCAT 62 GATGGGTCTCTGGTTCAT 63 GATGGGTCTCTCGTTCAT 64 GATGGGTCTCTTGTTCAT

[0050] When these 64 probes are grouped into every eight probes in order of the fluorescence intensity obtained by hybridization with the normal nucleic acid (hereinafter “fluorescence intensity of a probe(s)” means expected inensity of a hybrid of the probe with a nucleic acid of normal sequence, if not otherwise stated), the total fluorescence intensity of the first group should be extremely high and the total fluorescence quantity of the sixth, seventh and eighth groups should be zero. Such classification by the fluorescence intensity may be performed empirically or theoretically through calculation.

[0051] Then, the hybridization reaction is performed in an actual system and the total quantity of fluorescence is determined for each classified group. Particularly, the fluorescence quantities of the sixth, seventh and eighth groups are important. Normally, fluorescence is not expected for these groups. When fluorescence is detected unexpectedly for these groups, it is understood that the sample is not normal but having variation at a part of the sequence. When the fluorescence is not detected for these groups, most of the sequences are normal.

[0052] For more accuracy, however, it is necessary to compare the fluorescence quantity of the first group with a normal case. When the fluorescence quantity is significantly lower than that expected for normal sequence, base variation is suspected. Further the histogram of the fluorescence intensity in the regions having medium fluorescence intensity, i.e. the second, third and fourth groups, should be compared with that of the normal one.

[0053] For more effective detection of variation, one may consider a method wherein the probes are arranged in order of signal intensity, that is, lined in the order of sequences of full match, one-base mismatch, two-base mismatch, and three-base mismatch to the normal sequence from one end, so that signal intensity may be zero in the region opposite to the region where the full match probe is placed. In this case, the total distribution pattern of the fluorescence intensity is examined. For example, when using an arrangement of the separate regions shown in FIG. 1A (one column corresponds to one separate region), a monotonously decreasing pattern as shown in FIG. 1B is obtained for the normal case while patterns having peaks in the intermediate regions will be obtained with variants.

[0054] The above arrangement method is similarly used for the case where the probes for multi-item testing are arranged on the same substrate. For each item, first, the full matched probe (to the normal sequence), then one-base mismatched sequences, two-base mismatched sequences and so on are placed in order of the strength of fluorescence intensity expected.

[0055] Such concept is universally applicable to any number of variation, not limited to the above method where the variation for only three bases is tested.

[0056] In addition, while we explained the cases where the signals can be obtained when the hybrids are formed, the method may be set such that signals are not obtained when the hybrids are formed, and obtained when the hybrids are not formed, depending on the signal generating system.

[0057] The sample nucleic acid can be prepared by extracting from the gene to be tested according to the necessity. The control normal nucleic acid can be synthesized on the basis of the known sequence.

[0058] The length of the probe fixed to the substrate is not limited so long as it is suitable for detection, for example, preferably 8 mer to 30 mer, more preferably 12 mer to 25 mer.

[0059] Probes may be fixed to the substrate by various methods, but droplet application by the ink jet method is preferably used in order to arrange spots of fixed probes with high efficiency, high density and high speed.

[0060] Each spot in the separate region is preferably 70 to 100 μm in diameter and spaced not to connect each other.

[0061] The spot number in each region is set so that the spots expected to have high fluorescence intensity can be measured together and the spots expected to have no fluorescence can be measured together, considering the constitution of the sensor.

[0062] A system for detecting variation of the present invention comprises a DNA array substrate wherein plural separate regions are arranged in a prescribed arrangement, and a sensor for measuring the signals in the separate regions subjected to measurement. In addition, for calling the presence or absence of variation using the DNA array substrate where the plural separate regions are provided, the signals from all separate regions may be detected and the results are used for calling the presence or absence of variation or, as a simpler method, signals may be compared between certain regions selected so as to detect the presence or absence of variation.

[0063] Computer analysis is conveniently performed connecting a computer system to the detection system. For many variants of a gene sequence, histogram patterns of signal intensity are prepared and stored in the computer, which helps the determination of variant easily and correctly.

[0064] As the sensor for signal detection, different types of photodiodes (e.g. Hamamatsu Photonics K.K.-made) are used. For example, some of the divided type silicon photodiode arrays can be used as both line sensor and area sensor. Particularly, those having a photo-receiving face of 1 to 2 mm×(2 to 3) mm are suitably used.

EXAMPLES

[0065] The present invention will be described in more detail referring to Examples below. Herein, “%” means “weight %”.

Example 1

[0066] Preparation of DNA Array for Detection of Variant Gene for Line Sensor

[0067] 1) Preparation of DNA Array Linked with 64 Types of Probes

[0068] (1) Probe Design

[0069] It is well known that in the base sequence CGGAGG corresponding to the AA248 and AA249 of the tumor suppressor gene p53, frequently observed variation is those the first C to T, the second A to G for AA248, and the third G to T for AA249. Accordingly, aiming at these three positions, 64 types of probes were designed.

[0070] That is, the designed nucleic acid are 18-mer nucleic acids harboring variegated above mentioned six bases sandwiched between the common sequences, represented by 5′ATGAACNNGAGNCCCATC3′ where N corresponds to any of 4 bases, A, G, C and T. Actual probes to detect the above sequence should be have a complementary sequence of 5′GATGGGNCTCNNGTTCAT3′.

[0071]FIG. 2 shows an arrangement of 64 types of DNA probes on a substrate. A sequence 5′ ATGAACCGGAGGCCCATC3′ corresponding to the normal gene is expected to form a hybrid with DNA of probe 42 of 5′GATGGGCCTCCGGTTCAT3′ positioned at the third from the right and the third from the top.

[0072] (2) Preparation of Substrate Introduced with Maleimide Group

[0073] Substrate Cleaning

[0074] A glass plate of 1 inch square was placed in a rack and soaked in an ultrasonic cleaning detergent overnight. Then, after 20 min of ultrasonic cleaning, the detergent was removed by washing with water. After rinsing the plate with distilled water, ultrasonic treatment was repeated in a container filled with distilled water, for additional 20 min. Then the plate was soaked in a prewarmed 1N sodium hydroxide solution for 10 min, washed with water and then distilled water.

[0075] Surface Treatment

[0076] Then the plate was soaked in an aqueous solution of 1% silane coupling agent (product of Shin-Etsu Chemical Industry: Trade name KBM 603) at a room temperature for 20 min, thereafter nitrogen gas was blown on the both sides blowing off water to dryness. The silane coupling treatment was completed by baking the plate in an oven at 120° C. for 1 hour. Subsequently, 2.7 mg of EMCS (N-(6-maleimidocaproyloxy) succinimide: Dojin Company) was weighed and dissolved in a 1:1 solution of DMSO/ethanol (final concentration: 0.3 mg/ml). The glass substrate treated with the silane coupling agent was soaked in this EMCS solution for 2 hours to react the amino group of the silane coupling agent with the succimide group of EMCS. At this stage, the maleimide group of EMCS is transferred to the glass surface. After that, the glass plate was washed with ethanol, and dried with nitrogen gas to be used for a coupling reaction with DNA.

[0077] 3. Coupling of DNA to the Substrate

[0078] Synthesis of 64 DNA Probes

[0079] The 64 types of probe DNAs shown in Table 1 each having an SH group (thiol group) at the 5′ terminus were synthesized by a standard method.

[0080] Ejection of DNA Probes

[0081] Each DNA was dissolved in water and diluted with SG Clear (aqueous solution containing 7.5% of glycerin, 7.5% of urea, 7.5% of thiodiglycol and 1% of acetylenol EH), to a final concentration of 8 μM.

[0082] Then 100 μl of this DNA solution was filled into a nozzle of a BJ printer Head BC 62 (Canon) modified to eject a small amount, and to eject six solutions per head. Two heads were used at a time so that 12 types of DNAs could be ejected at once, and the heads were changed 6 times so that 64 spots of 64 types of DNAs were formed independently on the predetermined positions. Thus obtained was a DNA array in which separate regions were arranged in a predetermined manner. The pitch of spots was 200 μm and the area formed with 8×8 spots was about 2 mm×2 mm.

[0083] After that, the plate was left standing in a humidified chamber for 30 min for linking reaction of the probe DNA to the substrate.

[0084] 2. Measurement of Fluorescence Intensity of 64 Hybrids with Normal p53 Sequence

[0085] (1) Hybridization Reaction

[0086] Blocking Reaction

[0087] After completion of the reaction, the substrate was washed with a 1 M NaCl/50 mM phosphate buffer solution (pH 7.0) to wash out thoroughly the DNA solution on the glass surface. Then, this was soaked in an aqueous solution of 2% bovine serum albumin and allowed to stand for 2 hours to carry out a blocking reaction.

[0088] Preparation of Model Sample DNA

[0089] Rhodamine labeled DNA No. 1 (SEQ ID NO: 65) of the same length as the probes but having the normal sequence of p53 gene was prepared. The sequence is shown below and rhodamine is bonded to the 5′ terminus. (Synthesis of model sample of DNA).

[0090] The labeled DNA No. 65 (single strand) having the normal sequence of p53 gene (complementary to No. 42) and the same length in the same region as the probe DNA was prepared. The sequence is as shown below where rhodamine (Rho) is bound to the 5′-terminal.

[0091] No. 65: 5′-Rho-ATGAACCGGAGGCCCATC-3′

[0092] Reaction Condition of Hybridization

[0093] Two milliliters of 50 nM DNA solution of a model sample containing 100 mM NaCl was placed into a container containing the DNA array substrate for a hybridization reaction. Initially it was heated at 70° C. for 30 min, then the temperature of the incubator was lowered to 40° C. and the reaction was continue for 3 hours.

[0094] (2) Detection

[0095] Method

[0096] The detection was performed by connecting an image analysis processing apparatus, ARGUS (a product of Hamamatsu Photonics) to a fluorescence microscope (a product of Nicon).

[0097] (3) Result

[0098] Distribution of the fluorescence quantity on the substrate obtained is shown in FIG. 3.

Example 2

[0099] Preparation of DNA Array for Detection of Variant Gene for Line Sensor

[0100] (1) Preparation of DNA Array for Detection of Variation

[0101] In Tables 2 and 3, these 64 probes are grouped in every 8 probes in order of intensity based on the above described results. The fluorescence intensity of the first group should be extremely strong, and the total fluorescence quantity of the sixth, seventh and eighth groups should be zero. TABLE 2 Group SEQ ID No. NO: Type of Mismatch 1 42 Full mismatch 58 One-base mismatch 34 One-base mismatch 41 One-base mismatch 46 One-base mismatch 10 One-base mismatch 44 One-base mismatch 43 One-base mismatch 2 26 One-base mismatch 38 One-base mismatch 50 Two-base mismatch 2 Two-base mismatch 6 Two-base mismatch 9 Two-base mismatch 11 Two-base mismatch 12 Two-base mismatch 3 14 Two-base mismatch 18 Two-base mismatch 22 Two-base mismatch 25 Two-base mismatch 27 Two-base mismatch 28 Two-base mismatch 30 Two-base mismatch 33 Two-base mismatch 4 35 Two-base mismatch 36 Two-base mismatch 37 Two-base mismatch 39 Two-base mismatch 40 Two-base mismatch 45 Two-base mismatch 47 Two-base mismatch 48 Two-base mismatch

[0102] TABLE 3 Group SEQ ID No. NO: Type of Mismatch 5 54 Two-base mismatch 57 Two-base mismatch 59 Two-base mismatch 60 Two-base mismatch 62 Two-base mismatch 1 Three-base mismatch 3 Three-base mismatch 4 Three-base mismatch 6 5 Three-base mismatch 7 Three-base mismatch 8 Three-base mismatch 13 Three-base mismatch 15 Three-base mismatch 16 Three-base mismatch 17 Three-base mismatch 19 Three-base mismatch 7 20 Three-base mismatch 21 Three-base mismatch 23 Three-base mismatch 24 Three-base mismatch 29 Three-base mismatch 31 Three-base mismatch 32 Three-base mismatch 49 Three-base mismatch 8 51 Three-base mismatch 52 Three-base mismatch 53 Three-base mismatch 55 Three-base mismatch 56 Three-base mismatch 61 Three-base mismatch 63 Three-base mismatch 64 Three-base mismatch

[0103] Then, the surface of the substrate was separated into eight columns to arrange the first group, the second group and so on in order of intensity from the left to the right. Then, the probes were arranged at the positions of the corresponding probe numbers as shown in FIG. 4 using the ink jet method in the same manner is in Example 1 to prepare the DNA arrays for detecting variation. The lines composed of each group have intervals of 200 μm.

[0104] (2) Testing Normal Gene Using DNA Array

[0105] The hybridization reaction was carried out under the same conditions as in Example 1. Thereafter, the total fluorescence of each group was detected using a line sensor (S 272102: Hamamatsu Photonics K.K.-made).

[0106] The results are shown in FIGS. 4 and 5. The fluorescence of the first group was the highest and the intensity of the second group decreases below the half of that. Fluorescence was hardly observed in the sixth, seventh and eighth groups.

Example 3

[0107] Detection of Variant Gene Using DNA Array (1)

[0108] 1. Synthesis of Model Variant DNA

[0109] The labeled DNA No. 66 having the same length as the probes and a sequence complementary to No. 46 probe that differs by one base from the normal sequence of p53 gene was prepared. The sequence is shown below. Rhodamine was bound to the 5′-terminal. The underlined part is the variant position.

[0110] No. 66: 5′-Rho-ATGAACCAGAGGCCCATC-3′ (SEQ ID NO:66)

[0111] Reaction Conditions for Hybridization

[0112] The hybridization reaction was carried out under the same conditions as in Example 1. The concentration of the model sample DNA was 50 nM.

[0113] Detection by Line Sensor

[0114] After the hybridization reaction, the DNA array was evaluated in the same manner as in Example 2 using a line sensor. As shown in FIG. 6, the fluorescence was observed in the sixth, seventh and eighth groups where the fluorescence was not observed with the normal gene. Thus, it was easily judged that the sample gene was not normal. Further, since the fluorescence intensity was high at the first and the fourth groups corresponding to probe Nos. 35 to 48, it is inferred that they have the variation in the upper right quarter of FIG. 2 with a sequence very close to the normal gene.

[0115] This result shows that the screening method for variant genes of the present invention is extremely effective.

Example 4

[0116] Detection of Variant Gene Using DNA Array (2)

[0117] The hybridization reaction was carried out under the same conditions as in Example 3, except that the concentration of the model target gene used for the hybridization reaction was changed to 5 nM. The result is shown in FIG. 7.

[0118] Since the results similar to Example 3 were obtained to show that the method of the present invention works not depending on the hybridization reaction conditions.

Example 5

[0119] Detection of Variant Gene Using DNA Array (3)

[0120] Synthesis of Variant Model Sample DNA

[0121] The labeled DNA No. 67 having the same length as the probes and a sequence complementary to No. 10 probe that differs by one base from the normal sequence of p53 gene was prepared. The sequence is as shown below where Rho represents rhodamine bound to the 5′-terminus. The underlined part is the variant position.

[0122] No. 67: 5′-Rho-ATGAACCGGAGTCCCATC-3′ (SEQ ID NO: 67)

[0123] Hybridization Reaction

[0124] The hybridization reaction was carried out under the same conditions as in Example 1 using the above variation model sample DNA. The concentration of the sample was 50 nM. The result is shown in FIG. 8.

[0125] Fluorescence was observed in the sixth and seventh groups which was not observed for the normal gene, showing that this was not a normal gene. Since the fluorescence in the second group was higher than in the first group which would be the highest for the normal gene, it is presumed that it has a variation included in the probe sequences of the second group (the upper left quarter in FIG. 2).

[0126] This result shows that the screening method for variant genes of the present invention is feasible not depending on the variation types.

Example 6

[0127] Preparation of DNA Array for Area Sensor

[0128] The probes grouped as shown in Example 2 were arranged by the ink jet method in the separate regions on the substrate as shown in FIG. 9 to prepare the DNA array for area sensor.

[0129] Then, the hybridization reaction was performed using the variant model sample used in Example 4. As a result, the fluorescence was observed in the sixth, seventh and eighth groups which was not observed for the normal gene, therefore, it was easily judged that this gene was a variant one.

1 67 1 18 DNA Artificial sequence Sample oligonucleotide 1 gatgggactc aagttcat 18 2 18 DNA Artificial sequence Sample oligonucleotide 2 gatgggactc aggttcat 18 3 18 DNA Artificial sequence Sample oligonucleotide 3 gatgggactc acgttcat 18 4 18 DNA Artificial sequence Sample oligonucleotide 4 gatgggactc atgttcat 18 5 18 DNA Artificial sequence Sample oligonucleotide 5 gatgggactc gagttcat 18 6 18 DNA Artificial sequence Sample oligonucleotide 6 gatgggactc gggttcat 18 7 18 DNA Artificial sequence Sample oligonucleotide 7 gatgggactc gcgttcat 18 8 18 DNA Artificial sequence Sample oligonucleotide 8 gatgggactc gtgttcat 18 9 18 DNA Artificial sequence Sample oligonucleotide 9 gatgggactc cagttcat 18 10 18 DNA Artificial sequence Sample oligonucleotide 10 gatgggactc cggttcat 18 11 18 DNA Artificial sequence Sample oligonucleotide 11 gatgggactc ccgttcat 18 12 18 DNA Artificial sequence Sample oligonucleotide 12 gatgggactc ctgttcat 18 13 18 DNA Artificial sequence Sample oligonucleotide 13 gatgggactc tagttcat 18 14 18 DNA Artificial sequence Sample oligonucleotide 14 gatgggactc tggttcat 18 15 18 DNA Artificial sequence Sample oligonucleotide 15 gatgggactc tcgttcat 18 16 18 DNA Artificial sequence Sample oligonucleotide 16 gatgggactc ttgttcat 18 17 18 DNA Artificial sequence Sample oligonucleotide 17 gatggggctc aagttcat 18 18 18 DNA Artificial sequence Sample oligonucleotide 18 gatggggctc aggttcat 18 19 18 DNA Artificial sequence Sample oligonucleotide 19 gatggggctc acgttcat 18 20 18 DNA Artificial sequence Sample oligonucleotide 20 gatggggctc atgttcat 18 21 18 DNA Artificial sequence Sample oligonucleotide 21 gatggggctc gagttcat 18 22 18 DNA Artificial sequence Sample oligonucleotide 22 gatggggctc gggttcat 18 23 18 DNA Artificial sequence Sample oligonucleotide 23 gatggggctc gcgttcat 18 24 18 DNA Artificial sequence Sample oligonucleotide 24 gatggggctc gtgttcat 18 25 18 DNA Artificial sequence Sample oligonucleotide 25 gatggggctc cagttcat 18 26 18 DNA Artificial sequence Sample oligonucleotide 26 gatggggctc cggttcat 18 27 18 DNA Artificial sequence Sample oligonucleotide 27 gatggggctc ccgttcat 18 28 18 DNA Artificial sequence Sample oligonucleotide 28 gatggggctc ctgttcat 18 29 18 DNA Artificial sequence Sample oligonucleotide 29 gatggggctc tagttcat 18 30 18 DNA Artificial sequence Sample oligonucleotide 30 gatggggctc tggttcat 18 31 18 DNA Artificial sequence Sample oligonucleotide 31 gatggggctc tcgttcat 18 32 18 DNA Artificial sequence Sample oligonucleotide 32 gatggggctc ttgttcat 18 33 18 DNA Artificial sequence Sample oligonucleotide 33 gatgggcctc aagttcat 18 34 18 DNA Artificial sequence Sample oligonucleotide 34 gatgggcctc aggttcat 18 35 18 DNA Artificial sequence Sample oligonucleotide 35 gatgggcctc acgttcat 18 36 18 DNA Artificial sequence Sample oligonucleotide 36 gatgggcctc atgttcat 18 37 18 DNA Artificial sequence Sample oligonucleotide 37 gatgggcctc gagttcat 18 38 18 DNA Artificial sequence Sample oligonucleotide 38 gatgggcctc gggttcat 18 39 18 DNA Artificial sequence Sample oligonucleotide 39 gatgggcctc gcgttcat 18 40 18 DNA Artificial sequence Sample oligonucleotide 40 gatgggcctc gtgttcat 18 41 18 DNA Artificial sequence Sample oligonucleotide 41 gatgggcctc cagttcat 18 42 18 DNA Artificial sequence Sample oligonucleotide 42 gatgggcctc cggttcat 18 43 18 DNA Artificial sequence Sample oligonucleotide 43 gatgggcctc ccgttcat 18 44 18 DNA Artificial sequence Sample oligonucleotide 44 gatgggcctc ctgttcat 18 45 18 DNA Artificial sequence Sample oligonucleotide 45 gatgggcctc tagttcat 18 46 18 DNA Artificial sequence Sample oligonucleotide 46 gatgggcctc tggttcat 18 47 18 DNA Artificial sequence Sample oligonucleotide 47 gatgggcctc tcgttcat 18 48 18 DNA Artificial sequence Sample oligonucleotide 48 gatgggcctc ttgttcat 18 49 18 DNA Artificial sequence Sample oligonucleotide 49 gatgggtctc aagttcat 18 50 18 DNA Artificial sequence Sample oligonucleotide 50 gatgggtctc aggttcat 18 51 18 DNA Artificial sequence Sample oligonucleotide 51 gatgggtctc acgttcat 18 52 18 DNA Artificial sequence Sample oligonucleotide 52 gatgggtctc atgttcat 18 53 18 DNA Artificial sequence Sample oligonucleotide 53 gatgggtctc gagttcat 18 54 18 DNA Artificial sequence Sample oligonucleotide 54 gatgggtctc gggttcat 18 55 18 DNA Artificial sequence Sample oligonucleotide 55 gatgggtctc gcgttcat 18 56 18 DNA Artificial sequence Sample oligonucleotide 56 gatgggtctc gtgttcat 18 57 18 DNA Artificial sequence Sample oligonucleotide 57 gatgggtctc cagttcat 18 58 18 DNA Artificial sequence Sample oligonucleotide 58 gatgggtctc cggttcat 18 59 18 DNA Artificial sequence Sample oligonucleotide 59 gatgggtctc ccgttcat 18 60 18 DNA Artificial sequence Sample oligonucleotide 60 gatgggtctc ctgttcat 18 61 18 DNA Artificial sequence Sample oligonucleotide 61 gatgggtctc tagttcat 18 62 18 DNA Artificial sequence Sample oligonucleotide 62 gatgggtctc tggttcat 18 63 18 DNA Artificial sequence Sample oligonucleotide 63 gatgggtctc tcgttcat 18 64 18 DNA Artificial sequence Sample oligonucleotide 64 gatgggtctc ttgttcat 18 65 18 DNA p53 fragment Sample oligonucleotide 65 atgaaccgga ggcccatc 18 66 18 DNA Artificial sequence Sample oligonucleotide 66 atgaaccaga ggcccatc 18 67 18 DNA Artificial sequence Sample oligonucleotide 67 atgaaccgga gtcccatc 18 

What is claimed is:
 1. A method for screening of the presence or absence of variation in a region of a nucleic acid comprising the steps of: (a) preparing a test nucleic acid corresponding to the region; (b) preparing a probe having a base sequence fully complementary to a normal sequence of the region, and a plurality of probes each having at least one base not complementary to the normal sequence; (c) fixing the probes in separate regions on a surface of a substrate to prepare a DNA array substrate; (d) reacting the test nucleic acid with the probes on the DNA array substrate; (e) measuring signals in each region totaly where the signals are originated from respective hybrids formed between the test nucleic acid and one of the probes; and (f) determining the presence or absence of mutation in the test nucleic acid comparing with a histogram pattern of signals of all regions obtained using a normal sample without variation.
 2. The method according to claim 1, wherein the signal is a light emitted from each hybrid and the total signal is measured as a total light quantity emitted from each region.
 3. The method according to claim 2, wherein the light is fluorescence.
 4. The method according to claim 2, wherein the light is a chemical luminescence.
 5. The method according to claim 1, wherein the steps (c) to (e) are: (c) preparing separated regions on a substrate by fixing probes on a surface of the substrate, wherein the separate regions comprises: a first region containing probes which provide a signal of a certain intensity on reaction with a nucleic acid having normal sequence, a second region containing probes which provide weaker signals on reaction with a nucleic acid having normal sequence, and the third region containing probes which do not form hybrids on reaction with a nucleic acid having normal sequence; (d) reacting the DNA array of step (c) with a nucleic acid having normal sequence and measuring a signal of at least one region selected from the three regions to obtain a first pattern; and reacting the DNA array of step (c) with the test nucleic acid, and measuring a signals of at least one region corresponding to the selected region of the step (d) to obtain a second pattern; and (e) determining the presence or absence of variation in the test nucleic acid by comparing the first and second patterns.
 6. The method according to claim 5, wherein the selected region is the first region giving a strongest total signal and/or the third region giving no or a weakest signal on reaction with a nucleic acid having normal sequence.
 7. The method according to claim 5, wherein the separate regions are arranged on the substrate in order of signal intensity obtainable by reacting with a nucleic acid having normal sequence, from the highest intensity to the lowest intensity along a direction of a detection.
 8. The method according to claim 5, wherein the selected region is the third region, and when a total signal is detected with the test nucleic acid in the step (d), variation is called positive, and the test nucleic acid is determined to have variation.
 9. The method according to claim 5, wherein the first region contains probes consisting of a probe having a fully complementary sequence to the normal sequence and probes having one-base mismatch to the normal sequence. When reacting with a normal base sequence of a nucleic acid.
 10. The method according to claim 5, wherein the selected regions are both of the first and the third region and determining the presence or absence of variation comparing the ratio of the intensity of the third region to that of the first region.
 11. The method according to claim 5, wherein the selected regions are all of the region, and determined the presence of absence of variation comparing the histogram pattern of signal intensity.
 12. The method according to claim 5, wherein detection of the total signal is performed by an area sensor.
 13. The method according to claim 7, wherein detection of the total signal is performed by a line sensor.
 14. The method according to claim 1, wherein a base length of the probes is 8 mer to 30 mer.
 15. The method according to claim 14, wherein the base length of the probes is 12 mer to 25 mer.
 16. A DNA array substrate for screening a variation in a region of a nucleic acid, wherein a full match probe fully complementary to a normal sequence of the region, and a plurality of mismatch probes having at least one base mismatch to the sequence are arranged on the substrate; and the probes are arranged to form at least two separate regions selected from: a first region containing at least one probe which provides a signal of a certain intensity on reaction with a nucleic acid having the normal sequence, a second region containing at least one probe which provides a weaker signal than the probe of the first region on reaction with a nucleic acid having normal sequence, and the third region containing at least one probe which provides no signal on reaction with a nucleic acid having normal sequence.
 17. The DNA array substrate according to claim 16, wherein the signal is fluorescence.
 18. The DNA array substrate according to claim 16, wherein the signal is chemical luminescence.
 19. The DNA array substrate according to claim 16, wherein the first region contains the full match probe and the mismatch probes having one mismatch base. When reacting with a normal base sequence of a nucleic acid.
 20. The DNA array substrate according to claim 16, wherein the separate regions are arranged on the substrate in order of total signal intensity obtainable by reacting with a nucleic acid having normal sequence, from a highest intensity to a lowest intensity along a direction of a detection.
 21. The DNA array substrate according to claim 16, wherein a length of the probes is 8 mer to 30 mer.
 22. The DNA array substrate according to claim 21, wherein the length of the probes is 12 mer to 25 mer.
 23. A system for detecting variation comprising a DNA array substrate according to claim 16 and a signal measuring apparatus which measures signals from separate regions of the DNA array substrate. 